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US 60/086,206 
1998-05-21 

US 60/3rO6,834 

19 

uidows Version 3. 




Input Set : A:\Seqlist.txt 

Output Set: N:\CRF3\12192001\I2464 51A.raw 

<110> APPLICANT: California Institute of Technology; IS 
Frances H. Arnold ^ 
Hyun Joo 

<120> TITLE OF INVENTION: Oxygenase Enzymes and Screening Method 
<130> FILE REFERENCE: 9373/1E827 -US1 
<140> CURRENT APPLICATION NUMBER: US 09/246, 4 51A 
<141> CURRENT FILING DATE: 1999-02-09 
<150> PRIOR APPLICATION NUMBER: US 60/094,403 
<151> PRIOR FILING DATE: 1998-07-28 
<150> PRIOR APPLICATION NUMBER: US 60/106,840 
<151> PRIOR FILING DATE: 1998-11-03 
<150> PRIOR APPLICATION NUMBER 
<151> PRIOR FILING DATE 
<150> PRIOR APPLICATION NUMBER 
<151> PRIOR FILING DATE: 19, 
<160> NUMBER OF SEQ ID NOS 
<170> SOFTWARE: FastSEQ for 
<210> SEQ ID NO: 1 ^ 



<211> LENGTH: 1402 « 
<212> TYPE: DNA 
<213> ORGANISM: P. Putida 
<400> SEQUENCE: 1 

ctgcaggatc gttatccgct ggccgatctg atcacccagc gtttttccat cgacgaggcc 60 

agcaaggcac ttgaactggt caaggcagga gcactgatca aacccgtgat cgactccact 120 

ctttagccaa cccgcgttcc aggagaacaa caacaatgac gactgaaacc atacaaagca 180 

acgccaatct tgcccctctg ccaccccatg tgccagagca cctggtattc gacttcgaca 240 

tgtacaatcc gtcgaatctg tctgccggcg tgcaggaggc ctgggcagtt ctgcaagaat 300 

caaacgtacc ggatctggtg tggactcgct gcaacggcgg acactggatc gccactcgcg 360 

gccaactgat ccgtgaggcc tatgaagatt accgccactt ttccagcgag tgcccgttca 420 

tccctcgtga agccggcgaa gcctacgact tcattcccac ctcgatggat ccgcccgagc 4 80 

agcgccagtt tcgtgcgctg gccaaccaag tggttggcat gccggtggtg gataagctgg 540 

agaaccggat ccaggagctg gcctgctcgc tgatcgagag cctgcgcccg caaggacagt 600 

gcaacttcac cgaggactac gccgaaccct tcccgatacg catcttcatg ctgctcgcag 660 

gtctaccgga agaagatatc ccgcacttga aatacctaac ggatcagatc acccgtccgg 720 

atggcagcat gaccttcgca gaggccaagg aggcgctcta cgactatctg ataccgatca 7 80 

tcgagcaacg caggcagaag ccgggaaccg acgctatcag catcgttgcc aacggccagg 840 

tcaatgggcg accgatcacc agtgacgaag ccaagaggat gtgtggcctg ttactggtcg 900 

gcggcctgga tacggtggtc aatttcctca gcttcagcat ggagttcctg gccaaaagcc 960 

cggagcatcg ccaggagctg atcgagcgtc ccgagcgtat tccagccgct tgcgaggaac 1020 

tactccggcg cttctcgctg gttgccgatg gccgcatcct cacctccgat tacgagtttc 1080 

atggcgtgca actgaagaaa ggtgaccaga tcctgctacc gcagatgctg tctggcctgg 114 0 

atgagcgcga aaacgcctgc ccgatgcacg tcgacttcag tcgccaaaag gtttcacaca 1200 

ccacctttgg ccacggcagc catctgtgcc ttggccagca cctggcccgc cgggaaatca 1260 

tcgtcaccct caaggaatgg ctgaccagga ttcctgactt ctccattgcc ccgggtgccc 1320 

agattcagca caagagcggc atcgtcagcg gcgtgcaggc actccctctg gtctgggatc 1380 

cggcgactac caaagcggta ta 1402 
<210> SEQ ID NO: 2 ' 
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64 <211> LENGTH: 414 ^ 

65 <212> TYPE: PRT 

66 <213> ORGANISM: P. Putida 
68 <400> SEQUENCE: 2 



69 


Thr 


Thr 


Glu 


Thr 


He 


Gin 


Ser 


Asn 


Ala 


Asn 


Leu 


Ala 


Pro 


Leu 


Pro 


Pro 


70 


1 








5 










10 










15 




71 


His 


Val 


Pro 


Glu 


His 


Leu 


Val 


Phe 


Asp 


Pne 


Asp 


Met 


Tyr 


Asn 


Pro 


Ser 


72 








20 










25 










30 






73 


Asn 


Leu 


Ser 


Ala 


Gly 


Val 


Gin 


Glu 


Ala 


Trp 


Ala 


Val 


Leu 


Gin 


Glu 


Ser 


74 






35 










40 










45 








75 


Asn 


Val 


Pro 


Asp 


Leu 


Val 


Trp 


Thr 


Arg 


Cys 


Asn 


Gly 


Gly 


His 


Trp 


He 


76 




50 










55 










C A 










77 


Ala 


Thr 


Arg 


Gly 


Gin 


Leu 


He 


Arg 


Glu 


Ala 


Tyr 


Glu 


Asp 


Tyr 


Arg 


His 


78 


65 










70 










75 










O A 

80 


79 


Phe 


Ser 


Ser 


Glu 


Cys 


Pro 


Phe 


He 


Pro 


Arg 


Glu 


Ala 


Gly 


Glu 


Ala 


Tyr 


80 










85 










a a 

90 










95 




81 


Asp 


Phe 


He 


Pro 


Thr 


Ser 


Met 


Asp 


Pro 


Pro 


Glu 


Gin 


Arg 


Gin 


Phe 


Arg 


82 








100 










1 A c 

105 










1 1 A 
110 






83 


Ala 


Leu 


Ala 


Asn 


Gin 


Val 


Val 


Gly 


Met 


Pro 


Val 


Val 


Asp 


Lys 


Leu 


Glu 


O A 

o4 






lie 
115 










1 OA 










Iz J 








85 


Asn 


Arg 


He 


Gin 


Glu 


Leu 


Ala 


Cys 


Ser 


Leu 


He 


Glu 


Ser 


Leu 


Arg 


Pro 


86 




130 










135 










140 










87 


Gin 


Gly 


Gin 


Cys 


Asn 


Phe 


Thr 


Glu 


Asp 


Tyr 


Ala 


Glu 


Pro 


Phe 


Pro 


He 


88 


145 










150 










155 










160 


89 


Arg 


He 


Phe 


Met 


Leu 


Leu 


Ala 


Gly 


Leu 


Pro 


Glu 


Glu 


Asp 


He 


Pro 


His 


90 










165 










170 










175 




91 


Leu 


Lys 


Tyr 


Leu 


Thr 


Asp 


Gin 


Met 


Thr 


Arg 


Pro 


Asp 


Gly 


Ser 


Met 


Thr 


92 








180 










185 










190 






93 


Phe 


Ala 


Glu 


Ala 


Lys 


Glu 


Ala 


Leu 


Tyr 


Asp 


Tyr 


Leu 


He 


Pro 


He 


He 


94 






195 










200 










205 








95 


Glu 


Gin 


Arg 


Arg 


Gin 


Lys 


Pro 


Gly 


Thr 


Asp 


Ala 


He 


Ser 


He 


Val 


Ala 


96 




210 










215 










220 










97 


Asn 


Gly 


Gin 


Val 


Asn 


Gly 


Arg 


Pro 


He 


Thr 


Ser 


Asp 


Glu 


Ala 


Lys 


Arg 


98 


225 










230 










235 










240 


99 


Met 


Cys 


Gly 


Leu 


Leu 


Leu 


Val 


Gly 


Gly 


Leu 


Asp 


Thr 


Val 


Val 


Asn 


Phe 



100. 245 250 255 

101 Leu Ser Phe Ser Met Glu Phe Leu Ala Lys Ser Pro Glu His Arg Gin 

102 260 265 270 

103 Glu Leu He Glu Arg Pro Glu Arg He Pro Ala Ala Cys Glu Glu Leu 

104 275 280 285 

105 Leu Arg Arg Phe Ser Leu Val Ala Asp Gly Arg lie Leu Thr Ser Asp 

106 290 295 300 

107 Tyr Glu Phe His Gly Val Gin Leu Lys Lys Gly Asp Gin He Leu Leu 

108 305 310 315 320 

109 Pro Gin Met Leu Ser Gly Leu Asp Glu Arg Glu Asn Ala Cys Pro Met 

110 325 330 335 

111 His Val Asp Phe Ser Arg Gin Lys Val Ser His Thr Thr Phe Gly His 

112 340 345 350 

113 Gly Ser His Leu Cys Leu Gly Gin His Leu Ala Arg Arg Glu He He 
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114 355 360 365 

115 Val Thr Leu Lys Glu Trp Leu Thr Arg lie Pro Asp Phe Ser lie Ala 

116 370 375 380 

117 Pro Gly Ala Gin He Gin His Lys Ser Gly He Val Ser Gly Val Gin 

118 385 390 395 400 

119 Ala Leu Pro Leu Val Trp Asp Pro Ala Thr Thr Lys Ala Val 

120 405 410 

122 <210> SEQ ID NO: 3 

123 <211> LENGTH: 31 

124 <212> TYPE: DNA 

125 <213> ORGANISM: Artificial Sequence 

127 <220> FEATURE: 

128 <223> OTHER INFORMATION: Primer sequence 

130 <400> SEQUENCE: 3 

131 gatcatgaat gagaccgaca catcacctat c 

133 <210> SEQ ID NO: 4 

134 <211> LENGTH: 32 

135 <212> TYPE: DNA 

136 <213> ORGANISM: Artificial Sequence 

13 8 <220> FEATURE: 

139 <223> OTHER INFORMATION: Primer sequence 
141 <400> SEQUENCE: 4 

14 2 acgaattcta gaagaagaaa ctgaggttat tg 

144 <210> SEQ ID NO: 5 

145 <211> LENGTH: 27 

146 <212> TYPE: DNA 

147 <213> ORGANISM: Artificial Sequence 
14 9 <220> FEATURE: 

150 <223> OTHER INFORMATION: Primer sequence 

152 <400> SEQUENCE: 5 

153 ttggatccgg tggaccttgt ccatttg 

155 <210> SEQ ID NO: 6 

156 <211> LENGTH: 29 

157 <212> TYPE: DNA 

158 <213> ORGANISM: Artificial Sequence 

160 <220> FEATURE: 

161 <223> OTHER INFORMATION: Primer sequence 

163 <400> SEQUENCE: 6 

164 gctctagatc aaccgaagtg cttgtcgag 

166 <210> SEQ ID NO: 7 

167 <211> LENGTH: 24 

168 <212> TYPE: DNA 

169 <213> ORGANISM: Artificial Sequence 

171 <220> FEATURE: 

172 <223> OTHER INFORMATION: Primer sequence 

174 <400> SEQUENCE: 7 

175 cggaattcta ggaaacagac catg 

177 <210> SEQ ID NO: 8 

178 <211> LENGTH: 29 
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179 <212> TYPE: DNA 

180 <213> ORGANISM: Artificial Sequence 

182 <22 0> FEATURE: 

183 <223> OTHER INFORMATION: Primer sequence 

185 <400> SEQUENCE: 8 

186 ccggatccaa cctgggtcga agtcaaatg 29 

188 <210> SEQ ID NO: 9 

189 <211> LENGTH: 24 

190 <212> TYPE: DNA 

191 <213> ORGANISM: Artificial Sequence 

193 <220> FEATURE: 

194 <223> OTHER INFORMATION: Primer sequence 

196 <400> SEQUENCE: 9 

197 catcgatgct taggaggtca tatg 24 

199 <210> SEQ ID NO: 10 

200 <211> LENGTH: 25 

201 <212> TYPE: DNA 

202 <213> ORGANISM: Artificial Sequence 

204 <220> FEATURE: 

205 <223> OTHER INFORMATION: Primer sequence 

207 <400> SEQUENCE: 10 

208 tcatgtttga cagcttatca tcgat 25 

210 <210> SEQ ID NO: 11 

211 <211> LENGTH: 414 

212 <212> TYPE: PRT 

213 <213> ORGANISM: Artificial Sequence 

215 <220> FEATURE: 

216 <223> OTHER INFORMATION: Mutant M7-4H 
218 <400> SEQUENCE: 11 



219 


Thr 


Thr 


Glu 


Thr 


He 


Gin 


Ser 


Asn 


Ala 


Asn 


Leu 


Ala 


Pro 


Leu 


Pro 


Pro 


220 


1 








5 










10 










15 




221 


His 


Val 


Pro 


Glu 


His 


Leu 


Val 


Phe 


Asp 


Phe 


Asp 


Met 


Tyr 


Asn 


Pro 


Ser 


222 








20 










25 










30 






223 


Asn 


Leu 


Ser 


Ala 


Gly 


Val 


Gin 


Glu 


Ala 


Trp 


Ala 


Val 


Leu 


Gin 


Glu 


Ser 


224 






35 










40 










45 








225 


Asn 


Val 


Pro 


Asp 


Leu 


Val 


Trp 


Thr 


Arg 


Cys 


Asn 


Gly 


Gly 


His 


Trp 


He 


226 




50 










55 










60 










227 


Ala 


Thr 


Arg 


Gly 


Gin 


Leu 


He 


Arg 


Glu 


Ala 


Tyr 


Glu 


A sp 


Tyr 


Arg 


His 


228 


65 










70 










75 










80 


229 


Phe 


Ser 


Ser 


Glu 


Cys 


Pro 


Phe 


He 


Pro 


Arg 


Glu 


Ala 


Gly 


Glu 


Ala 


Tyr 


230 










85 










90 










95 




231 


Asp 


Phe 


He 


Pro 


Thr 


Ser 


Met 


Asp 


Pro 


Pro 


Glu 


Gin 


Arg 


Gin 


Phe 


Arg 


232 








100 










105 










110 






233 


Ala 


Leu 


Ala 


Asn 


Gin 


Val 


Val 


Gly 


Met 


Pro 


Val 


Val 


Asp 


Lys 


Leu 


Glu 


234 






115 










120 










125 








235 


Asn 


Arg 


He 


Gin 


Glu 


Leu 


Ala 


Cys 


Ser 


Leu 


He 


Glu 


Ser 


Leu 


Arg 


Pro 


236 




130 










135 










140 










237 


Gin 


Gly 


Gin 


Cys 


Asn 


Phe 


Thr 


Glu 


Asp 


Tyr 


Ala 


Glu 


Pro 


Phe 


Pro 


He 



238 145 150 155 160 
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239 


Arg 


lie Phe 


Met 


Leu 


Leu 


Ala 


Gly 


Leu 


Pro 


Glu 


CjlU 


Asp 


lie 


Pro 


HIS 


240 








165 










1 /U 










1 / D 




241 


Leu 


Lys Tyr 


Leu 


Thr 


Asp 


p 1 « 

Gin 


Met 


Thr 


Arg 


Pro 


Asp 


biy 


Ser 


Mer. 


Thr 


242 






180 










1 Q C 

185 










1 QA 

iy u 






243 


Phe 


Ala Glu 


Ala 


Lys 


Glu 


Ala 


Leu 


Tyr 


Asp 


Tyr 


Leu 


lie 


Pro 


lie 


Tl Q 

lie 


244 




195 










zUU 










O ft ^ 








245 


Glu 


Gin Arg 


Arg 


Gin 


Lys 


Pro 


Gly 


Thr 


Asp 


Ala 


lie 


Ser 


Tl fl 

lie 


y a i 

vai 


Aia 


246 




210 








215 










220 










247 


Asn 


Gly Gin 


Val 


Asn 


Gly 


Arg 


Pro 


lie 


Thr 


Ser 


Asp 


blU 


7V 1 — 

Ala 


Lys 


Arg 


248 


225 








230 










235 










O A f\ 

z4U 


249 


Met 


Cys Gly 


Leu 


Leu 


Leu 


Val 


Gly 


Gly 


Leu 


Asp 


Thr 


Val 


Val 


Asn 


Phe 


250 








245- 










250 










ZJJ 




251 


Leu 


Ser Phe 


Ser 


Met 


Glu 


Phe 


Leu 


Ala 


Lys 


Ser 


Pro 


blU 


His 


Arg 


bin 


252 






260 










1 c c 

2d j 










T7 A 

z / u 






253 


Glu 


Leu lie 


Glu 


Arg 


Pro 


Glu 


Arg 


lie 


Pro 


Ala 


Ala 


Cys 


GlU 


GlU 


Leu 


254 




275 










'l O A 

2 80 










ZOJ 








255 


Leu 


Arg Arg 


Phe 


Ser 


Leu 


Val 


Ala 


Asp 


Gly 


Arg 


He 


Leu 


Thr 


Ser 


Asp 


256 




290 








one 

295 










300 










257 


Tyr 


Glu Phe 


His 


Gly Val 


Gin 


Leu 


Lys 


Lys 


Gly Asp 


p 1 « 

Gin 


He 


Leu 


Leu 


258 


305 








310 










315 












259 


Pro 


Gin Met 


Leu 


Ser 


Gly 


Leu 


Asp 


Glu 


Arg 


Lys 


Asn 


Ala 

Aia 


Cys 


Pro 


M6l 


260 








325 










j JU 










335 




261 


His 


Val Asp 


Phe 


Ser 


Arg 


Gin 


Lys 


val 


Ser 


His 


Thr 


Thr 


Phe 


Gly 


Ui o 

HIS 


262 






340 










j4 j 










350 






263 


Gly 


Ser His 


Leu 


Cys 


Leu 


Gly 


Gin 


His 


Leu 


Ala 


Arg 


Arg 


Glu 


He 


Tl A 

lie 


264 




355 










360 










i e c: 








265 


Val 


Thr Leu 


Lys 


Glu 


Trp 


Leu 


Thr 


Arg 


He 


Pro 


Asp 


Fne 


Ser 


He 


Aia 


266 




370 








375 










380 










267 


Pro Gly Ala 


Gin 


He 


Gin 


His 


Lys 


Ser 


Gly 


He 


Val 


Ser 


Gly Val 


bin 


268 


385 








390 










395 










/inn 


269 


Ala 


Leu Pro 


Leu 


Val 


Trp 


Asp 


Pro 


Ala 


Thr 


Thr 


Lys 


Ala 


Val 






270 








405 










410 














272 


<210> 


• SEQ ID 


NO: 


12 
























273 


<211> 


- LENGTH: 


: 414 
























274 


<212> TYPE: PRT 


























275 


<213> 


• ORGANISM: Artificial Sequence 
















277 


<220> 


■ FEATURE: 


























278 


<223> 


- OTHER INFORMATION: 


Mutant M7-6H 














280 


<400> 


• SEQUENCE: 12 
























281 


Thr 


Thr Glu 


Thr 


He 


Gin 


Ser 


Asn 


Ala 


Asn 


Leu 


Ala 


Pro 


Leu 


Pro 


Pro 


282 


1 






5 










10 










15 




283 


His 


Val Pro 


Glu 


His 


Leu 


Val 


Phe 


Asp 


Phe 


Asp 


Met 


Tyr 


Asn 


Pro 


Ser 


284 






20 










25 










30 






285 


Asn 


Leu Ser 


Ala 


Gly Val 


Gin 


Glu 


Ala 


Trp 


Ala 


Val 


Leu 


Gin 


Glu 


Ser 


286 




35 










40 










45 








287 


Asn 


Val Pro 


Asp 


Leu 


Val 


Trp 


Thr 


Arg 


Cys 


Asn 


Gly Gly 


His 


Trp 


He 


288 




50 








55 










60 










289 


Ala 


Thr Arg Gly Gin 


Leu 


He 


Arg 


Glu 


Ala 


Tyr 


Glu 


Asp 


Tyr 


Arg 


His 


290 


65 








70 










75 










80 
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